Text searching allowing for inversions and translocations of factors

نویسندگان

  • Domenico Cantone
  • Simone Faro
  • Emanuele Giaquinta
چکیده

The approximate string matching problem consists in finding all locations at which a pattern p of length m matches a substring of a text t of length n, after a finite number of given edit operations. In this paper,we investigate such a problemwhen the edit operations are translocations of adjacent factors of equal length and inversions of factors. In particular, we first present anO(nmmax(α, β))-time andO(m2)-space algorithm, where α and β are respectively the maximum lengths of the factors which can be involved in any translocation and inversion, and show that under the assumptions of equiprobability and independence of characters our algorithmhas aO(n logσ m) average time complexity, for an alphabet of size σ .We also present a very fast variant of a recently proposed algorithm for the same problem, based on an efficient filtering method, which has a O(n)-time complexity in the average case, though in the worst case it retains the same O(nmmax(α, β))-time complexity. © 2013 Elsevier B.V. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

String Matching with Inversions and Translocations in Linear Average Time (Most of the Time)

We present an efficient algorithm for finding all approximate occurrences of a given pattern p of length m in a text t of length n allowing for translocations of equal length adjacent factors and inversions of factors. The algorithm is based on an efficient filtering method and has an O(nmmax(α, β))-time complexity in the worst case and O(max(α, β))space complexity, where α and β are respective...

متن کامل

Approximate String Matching Allowing for Inversions and Translocations

The approximate string matching problem consists in finding all locations at which a pattern P of length m matches a substring of a text T of length n, after a given finite number of edit operations. In this paper we investigate such problem when the string distance involves translocations of equal length adjacent factors and inversions of factors. In particular, we devise a O(nmmax(α, β))-time...

متن کامل

Chromosomal Analysis of Couples with Repeated Spontaneous Abortions in Northeastern Iran

Background Cytogenetic study of reproductive wastage is an important aspect in determining the genetic background of early embryogenesis. Approximately 15 to 20% of all pregnancies in humans are terminated as recurrent spontaneous abortions (RSAs). The aim of this study was to detect chromosome abnormalities in couples with RSAs and to compare our results with those reported previously. Materia...

متن کامل

Chromosomal distributions of breakpoints in cancer, infertility, and evolution.

We extract 11 genome-wide sets of breakpoint positions from databases on reciprocal translocations, inversions and deletions in neoplasms, reciprocal translocations and inversions in families carrying rearrangements and the human-mouse comparative map, and for each set of positions construct breakpoint distributions for the 44 autosomal arms. We identify and interpret four main types of distrib...

متن کامل

Efficient string-matching allowing for non-overlapping inversions

Inversions are a class of chromosomal mutations, widely regarded as one of the major mechanisms for reorganizing the genome. In this paper we present a new algorithm for the approximate string matching problem allowing for non-overlapping inversions which runs in O(nm) worst-case time and O(m2) space, for a character sequence of size n and pattern of size m. This improves upon a previous O(nm2)...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Discrete Applied Mathematics

دوره 163  شماره 

صفحات  -

تاریخ انتشار 2014